Page 1 of 5 



4K 



OIPE 



RAW SEQUENCE LISTING 

PATENT APPLICATION : US/09/898,238 



DATE: 08/21/2001 
TIME: 08:22:03 



Input Set : A:\sequence.txt 

Output Set: N:\CRF3\08212001\l898238.raw 



SEQUENCE LISTING 

4 (1) GENERAL INFORMATION: 

6 (i) APPLICANT: Wackett, Lawrence P. 

7 Sadowsky, Michael J. 

8 de Souza, Mervyn L. 

10 (ii) TITLE OF INVENTION: An Isolated and Purified DNA Molecule 

11 and Protein for the Degradation of Triazine Compounds 
13 (iii) NUMBER OF SEQUENCES: 2 

15 (iv) CORRESPONDENCE ADDRESS: 

16 (A) ADDRESSEE: Mueting, Raasch, Gebhardt & Schwappach, P. A. 

17 (B) STREET: P.O. Box 581415 

18 (C) CITY: Minneapolis 

10 I T\\ QTTSTVVT • MKT 

X _V \ i> / J. i ± J. J_l . win 

20 ( E) COUNTRY: USA 

21 (F) ZIP : 55458-1415 

23 (v) COMPUTER READABLE FORM: 

24 (A) MEDIUM TYPE: Floppy disk 

25 (B) COMPUTER: IBM PC compatible 

26 (C) OPERATING SYSTEM: PC-DOS/MS-DOS 

27 (D) SOFTWARE: Patentln Release #1.0, Version #1.30 
29 (vi) CURRENT APPLICATION DATA: 

C — > 30 (A) APPLICATION NUMBER: US/09/898 , 238 

C~> 31 (B) FILING DATE: 03-Jul-2001 

32 (C) CLASSIFICATION: 

34 (viii) ATTORNEY/AGENT INFORMATION: 

35 (A) NAME: Mueting, .Ann M. 

36 (B) REGISTRATION NUMBER: 33,977 

37 (C) REFERENCE/DOCKET NUMBER: 110.00230101 

39 (ix) TELECOMMUNICATION INFORMATION: 

40 (A) TELEPHONE: 612-305-1217 

41 (B) TELEFAX: 612-305-1228 
4 4 (2) INFORMATION FOR SEQ ID NO: 1: 

4 6 (i) SEQUENCE CHARACTERISTICS: 

47 (A) LENGTH: 1858 base pairs 

48 (B) TYPE: nucleic acid 

49 (C) STRANDEDNESS: single 

50 (D) TOPOLOGY: linear 

52 (ii) MOLECULE TYPE: DNA (genomic) 

54 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

56 CTCGGGTAAC TTCTTGAGCG CGGCCACAGC AGCCTTGATC ATGAAGGCGA GCATGGTGAC 60 

58 CTTGACGCCG CTCTTTTCGT TCTCTTTGTT GAACTGCACG CGAAAGGCTT CCAGGTCGGT ■ 120 

60 GATGTCCGCG TCGTCGTGGT TGGTGACGTG CGGGATGACC ACCCAGTTGC GGTGCAGGTT 180 

62 TTTCGATGGC ATAATATCTG CGTTGCGACG TGTAACACAC TATTGGAGAC ATATCATGCA 24 0 

64 AACGCTCAGC ATCCAGCACG GTACCCTCGT CACGATGGAT CAGTACCGCA GAGTCCTTGG 300 

66 GGATAGCTGG GTTCACGTGC AGGATGGACG GATCGTCGCG CTCGGAGTGC ACGCCGAGTC 360 

68 GGTGCCTCCG CCAGCGGATC GGGTGATCGA TGCACGCGGC AAGGTCGTGT TACCCGGTTT 4 20 

70 CATCAATGCC CACACCCATG TGAACCAGAT CCTCCTGCGC GGAGGGCCCT CGCACGGACG 4 80 
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72 TCAATTCTAT GACTGGCTGT TCAACGTTGT GTATCCGGGA CAAAAGGCGA TGAGACCGGA . 54 0 

7 4 GGACGTAGCG GTGGCGGTGA GGTTGTATTG TGCGGAAGCT GTGCGCAGCG GGATTACGAC 600 

7 6 GATCAACGAA AACGCCGATT CGGCCATCTA CCCAGGCAAC ATCGAGGCCG CGATGGCGGT 660 
78 CTATGGTGAG GTGGGTGTGA GGGTCGTCTA CGCCCGCATG TTCTTTGATC GGATGGACGG 720 
80 GCGCATTCAA GGGTATGTGG ACGCCTTGAA GGCTCGCTCT CCCCAAGTCG AACTGTGCTC 7 80 
82 GATCATGGAG GAAACGGCTG TGGCCAAAGA TCGGATCACA GCCCTGTCAG ATCAGTATCA 84 0 
84 TGGCACGGCA GGAGGTCGTA TATCAGTTTG GCCCGCTCCT GCCACTACCA CGGCGGTGAC 900 

8 6 AGTTGAAGGA ATGCGATGGG CACAAGCCTT CGCCCGTGAT CGGGCGGTAA TGTGGACGCT 960 
88 TCACATGGCG GAGAGCGATC AT GAT GAG CG GATTCATGGG ATGAGTCCCG CCGAGTACAT 1020 
90 GGAGTGTTAC GGACTCTTGG ATGAGCGTCT GCAGGTCGCG CATTGCGTGT ACTTTGACCG 1080 
92 GAAGGATGTT CGGCTGCTGC ACCGCCACAA TGTGAAGGTC GCGTCGCAGG TTGTGAGCAA 114 0 
94 TGCCTACCTC GGCTCAGGGG TGGCCCCCGT GCCAGAGATG GTGGAGCGCG GCATGGCCGT 1200 
96 GGGCATTGGA ACAGATAACG GGAATAGTAA TGACTCCGCA AACATGATCG GAGACATGAA 1260 
98 GTTTATGGCC CATATTCACC GCGCGGTGCA TCGGGATGCG GACGTGCTGA CCCCAGAGAA 1320 
100 GATTCTTGAA ATGGCGACGA TCGATGGGGC GCGTTCGTTG GGAATGGACC ACGAGATTGG 1380 
102 TTCCATCGAA ACCGGCAAGC GCGCGGACCT TATCCTGCTT GACCTGCGTC ACCTCAGACG 14 4 0 
104 ACTCTCACAT CATTTGGCGG CCACGATCGT GTTTCAGGCT TACGGCAATG AGGTGGACAC 1500 
106 TGTCCTGATT GACGGAAACG TTGTGATGGA GAACCGCCGC TTGAGCTTTC TTCCCCCTGA 1560 
108 ACGTGAGTTG GCGTTCCTTG AGGAAGCGCA GAGCCGCGCC ACAGCTATTT TGCAGCGGGC 1620 
110 GAACATGGTG GCTAACCCAG CTTGGCGCAG CCTCTAGGAA ATGACGCCGT TGCTGCATCC 1680 
112 GCCGCCCCTT GAGGAAATCG CTGCCATCTT GGCGCGGCTC GGATTGGGGG GCGGACATGA 174 0 
114 CCTTGATGGA TACAGAATTG CCATGAATGC GGCACTTCCG TCCTTCGCTC GTGTGGAATC 1800 
116 GTTGGTAGGT GAGGGTCGAC TGCGGGCGCC AGCTTCCCGA AGAGGTGAAA GGCCCGAG 1858 
124 (2) INFORMATION FOR SEQ ID NO: 2: 



126 (i) SEQUENCE CHARACTERISTICS: 

127 (A) LENGTH: 473 amino acids 

128 (B) TYPE: amino acid 

129 (C) STRANDEDNESS : single 

130 (D) TOPOLOGY: linear 
132 (ii) MOLECULE TYPE: protein 

134 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

136 Met Gin Thr Leu Ser He Gin His Gly Thr Leu Val Thr Met Asp Gin 

137 15 10 15 

139 Tyr Arg Arg Val Leu Gly Asp Ser Trp Val His Val Gin Asp Gly Arg 

140 20 25 30 

142 He Val Ala Leu Gly Val His Ala Glu Ser Val Pro Pro Pro Ala Asp 

143 35 40 45 

145 Arg Val He Asp Ala Arg Gly Lys Val Val Leu Pro Gly Phe He Asn 

146 50 55 60 

148 Ala His Thr His Val Asn Gin He Leu Leu Arg Gly Gly Pro Ser His 

149 . 65 70 75 80 

151 Gly Arg Gin Phe Tyr Asp Trp Leu Phe Asn Val Val Tyr Pro Gly Gin 

152 • 85 90 95 

154 Lys Ala. Met Arg Pro Glu Asp Val Ala Val Ala Val Arg Leu Tyr Cys 

155 100 105 110 

157 Ala Glu Ala Val Arg Ser Gly He Thr Thr He Asn Glu Asn Ala Asp 

158 115 120 125 

160 Ser Ala He Tyr Pro Gly Asn He Glu Ala Ala Met Ala Val Tyr Gly 

161 . 130 135 140 
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, v VERIFICATION SUMMARY DATE: 08/21/2001 
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L:30 M:220 C: Keyword misspelled or invalid format, [(A) APPLICATION NUMBER: ] 
L:31 M:220 C: Keyword misspelled or invalid format, [(B) FILING DATE:] 
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